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ABSTRACT 


A  NOTE  ON  CONVERGENCE  OF  FUNCTIONS  OF 


RANDOM  ELEMENTS 


i  y'j  *,  "f'J  />  ft  tOt-Cf'CS  //flry.f  of  X 

Convergence  ofNg(e^)  to  g(^)  is  considered  when^/t^j^p  in 


-  Convergence  of^g(  V  to  g($)  is  considered  when^g^^-  Jj)  m 

distribution  or  in  probability,  without  the  usual  restriction  that 

x' 

g  be  continuous  a.s.  under  the  distribution  of  It  is  shown  that 


upaer 


the  convergence  holds  for  arbitrary  Borel-measurable  g, 

if  in  addition  to  the  assumed  convergence  the  corresponding 

r(  *  *  1  y  y; 

measures  (TBkI  of  are  ‘‘contiguous’'  to  the  measure  P  of  fj  in  a 

certain  very  weak  sense.  Some  statistical  applications  are  indicated. 
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1.  Introduction  and  preliminaries.  Many  statistical  applications 
of  probability  theory  involve  a  question  of  convergence  of  g(£n)  to 
g(£),  n  ->•  ®,  where  g  is  a  given  function  applied  to  a  sequence  of 
r.v.'s  Cn  convergent  in  some  sense  to  £.  Typically,  the  discontinuity 
set  of  the  function  g  is  required  to  have  measure  0  with  respect  to 
the  distribution  of  £.  (See,  e.g.,  Billingsley  (1968),  §5,  and 
Serfling  (1980),  §1.7.)  However,  in  some  cases  it  is  of  interest  to 
relax  this  assumption  on  g.  Here  it  is  shown  that  g  may  be  an 
arbitrary  Borel-measurable  function,  if  in  addition  to  the  assumed 
convergence  £n  ->■  £  the  corresponding  measures  (Pn>  of  (£n)  are 
contiguous  to  the  measure  P  of  5  in  a  certain  very  weak  sense. 

We  will  consider  {£n>  and  £  to  be  random  elements  of  a  metric 
space  (S , p ) ,  and  we  will  assume  that  S  is  separable  (in  order  that 
pUn#£)  be  a  well-defined  random  variable  and  that  P  be  regular)  and 
locally  compact  (in  order  that  Luzin's  theorem  be  applicable).  We 
shall  treat  convergence  of  £n  to  £  in  distribution ,  £n  -*•  £  (by  which 
we  mean  that  the  distributions  Pn  converge  weakly  to  P,  Pfi  =»  P) ,  and 
in  probability,  Cn  ^  C  (by  which  we  mean  that  (£n>  and  £  are  defined 
on  a  common  probability  space  (fi,A,IP)  and  that  IP{p(£n,£)  >  e}  -*■  0, 
n  -*■  “,  each  e  >  0)  .  As  is  well-known  (Billingsley  (1968),  p.  26), 

P 

£n  £  implies  £n  -*■  £. 

Let  (X,A)  be  an  arbitrary  measurable  space.  Given  probability 
measures  { Pr }  and  P  on  A  and  a  particular  sequence  {Am>  in  A  such 
that  P(Am)  -*■  0,  m  -*■  »,  we  shall  say  that  the  sequence  (Pn>  is  weakly 
contiguous  to  P  at  the  sequence  (A  }  if 


(1.1) 


lim  lim  sup  pn(Am)  =  0. 
m-*-®  n-*-®  1  ! 

This  is  a  weak  variant  of  the  classical  notion  of  contiguity  due  to 

^  V 

LeCam  (1960)  and  used  by  Hajek  and  Sidak  (1967) ,  which  requires  that 
Pn(An)  -+  0  for  every  sequence  (An)  such  that  P(An)  ■*  0. 

REMARK  1.1.  (i)  Note  that  the  above  notion  of  weak  contiguity 

is  much  weaker  than,  and  implied  by,  equicontinuity  of  (Pn>  at  0 
(Kingman  and  Taylor  (1966,  p.  177) ,  which  requires  that 

(1.2)  lim  supn  pn(Am)  =  0 

for  every  sequence  {Am>  decreasing  to  0.  The  limit  in  (1.1)  is  less 
than  or  equal  to  that  in  (1.2),  we  do  not  require  that  An  l  0,  and 
we  permit  restriction  to  a  single  sequence  {An>. 

(ii)  Also,  the  above  notion  of  weak  contiguity  is  much  weaker 
than,  and  implied  by,  uniform  absolute  continuity  of  (pn)  with  respect 
to  P,  which  requires  that  for  every  e  >  0  there  exists  6  >  0  such  that 
P (A)  <6  implies  supn  pn(A)  <  e • 

(iii)  The  above  weak  notion  of  contiguity  is  relevant  to  the 

problem  of  convergence  of  gUn)  to  g(£),  because  our  treatment  will 

require  merely  that  (1.1)  hold  for  a  single  rather  special  sequence 

(B  }  defined  in  terms  of  the  given  function  g  and  the  limit  measure  P 
n 

through  an  application  of  Luzin's  theorem,  as  will  be  seen  below. 

(iv)  A  further  motivation  for  restricting  to  contiguity  at  a 


single  sequence  arises  from  certain  statistical  applications,  to  be 


discussed  in  detail  in  Section  2,  in  which  the  measures  {P  }  are  in 

n 

fact  empirical  measures  based  on  a  collection  of  observations  each 
having  distribution  P.  In  such  a  situation,  it  is  not  true  that  the 
sequence  (Pn>  is  with  probability  1  contiguous  to  P,  or  equicontinuous 
at  0,  or  uniformly  absolutely  continuous  with  respect  to  P.  However, 
it  is  easily  seen  that  if  the  empirical  measures  tPn)  (and  here  we  are 
allowing  consideration  of  empirical  measures  not  necessarily  the 
classical  versions)  satisfy  a  set-wise  strong  law  (i.e.,  for  each 
A  €  A,  with  probability  1  {Pn>  satisfies  pn(A)  -*■  P(A),n  -*■  »)  ,  then 
for  any  given  sequence  (An>  with  P(An)  +0,  the  sequence  (Pn>  satisfies 
(1.1)  with  probability  1.  This  "set-wise  strong  law  property"  is 
usually  easily  established  for  reasonable  notions  of  empirical 
measure .  Q 

Now  take  (X,A)  =  (S,S) ,  where  (S,p)  is  a  separable  and  locally 

compact  metric  space  and  S  is  the  Borel  a-algebra  generated  by  the 

open  sets.  We  specify  as  follows  the  sequences  {Am>  in  S  which  will 

arise  for  consideration  in  our  development.  Let  g:  S  IR  be 

S-measurable .  By  separability  of  S  the  measure  P  on  S  is  regular,  so 

that  by  local  compactness  of  S  we  may  apply  Luzin's  theorem  (Halmos 

(1950),  p.  242;  Cohn  (1980),  p.  227),  which  implies  the  existence  of 

continuous  functions  g  :  S  IR  and  sets  B„  such  that  P(B  )  -*•  0,m  +  ®, 

m  m  m 

with  S  -  Bm  compact  and  g  =  gm  on  S  -  Bm  (m  =  1,2,...).  We  shall  call 
^m,Bm^  a  iuzin  sequence  for  g  with  respect  to  P. 

Our  main  results  and  applications  will  be  presented  in  Section  2. 
One  application  will  concern  Lemma  2.1  of  van  Zwet  (1980),  which  in 
part  provided  the  inspiration  for  this  study.  A  broad  class  of 


Jn(*)»  n  -  1'  are  sPecified  functions,  Fn  ( • )  is  the  usual  empirical 
df  based  on  an  i.i.d.  sample  X^,...,Xn  from  a  df  F,  and  F~*(t) 
denotes  the  sample  t-quantile,  f”1 ( t )  =  inf{x:  Fn(x)  >  t}. 

2.  A  convergence  theorem  and  some  applications.  Without  ado, 
we  give 

THEOREM  2.1.  Let  (S,p)  be  a  separable  and  locally  compact  metric 

space  and  S  the  corresponding  Borel  o-algebra ,  and  let  {P^}  and  P  be 

probability  measures  on  S.  Let  g :  S  -*•  ]R  be  S-measurable  and  suppose 

that  (P  }  is  weakly  contiguous  to  P  at  {B  },  where  {g  ,B  }  is  a  Luzin 
n  J  m  3m  m 

sequence  for  g  with  respect  to  P. 

(i)  Suppose  that  Pn  =*  P.  Then 

(2.1)  Png_1  **  p9_1* 

(ii)  Let  (?n)  and  £  be  measurable  mappings  from  a  probability 

space  (fi,A,IP)  into  S  with  corresponding  measures  {?n)  and  P,  and 
P 

suppose  that  £n  -*■  5.  Then 

P 

(2.2)  g(5n)  +  gU) . 

PROOF.  We  first  prove  (ii) ,  which  then  will  be  utilized  in  the 
proof  of  (i) .  Let  e  >  0  be  given,  choose  an  integer  m  >  1,  and  write 

3P(|g(5_)  “  g(c)  I  >  e }  <  a  +6  +  y  , 

”  n  ^  1  -  n,m  n,m  'm' 


where  “n,m  “  Vl9m('n>  ‘  9m<E,l  >*>■  * 

JPU  6  B  }  =  P  (Bl,  and  ym  =  PU  6  B  >  =  P(B).  (This  is  possible 
by  the  definitions  and  discussion  of  Section  1.)  By  the  uniform 
continuity  of  gm  on  Bm,  3  6m  e  such  that  p (x,y)  <  6m^e  implies 
l9m(x)  -gm(y)  I  <  e  for  x,y  £  S-Bm.  Hence  <  P{p(?n,?)  >  Smfe>, 

P 

and  by  C  +  (  we  thus  have  a  ^  0  as  n  Therefore, 

J  n  n,m 

(2.3)  lim  supP{|g(£  )  -g(5)|  >  e>  <  lim  sup  3n  +  Ym- 

n-*-®  ' 

We  now  let  m  -*■<»,  applying  (1.1)  and  the  fact  that  ym  0,  and  obtain 
(2.2).  Thus  (ii)  is  proved. 

To  prove  (i) ,  we  apply  a  representation  of  Skorokhod  (1956)  as 
extended  by  Dudley  (1968),  to  introduce  a  probability  space  (fiQ,80,p0) 
and  8q  -  5-measurable  S-valued  functions  (nn>  and  n  defined  on  RQ  such 
that  the  distribution  of  nn  is  Pn(n  -  1,2,...)  and  that  of  n  is  P, 
and  nn  -*•  n,n  -+  ®,  a.s.  [ m q ] .  Here  Rg  =  R  *  R*,  where  R*  =  nn_i  nn» 
with  nn  =  sn  x  fn^n  a  c°Py  of  s'  and  fn  a  copy  of  the  unit  interval 
[0,1].  Thus  Rq  is  separable  and  locally  compact,  by  virtue  of  these 
properties  for  the  spaces  S  and  [0,1],  and  taking  Bg  to  be  the  product 
o-algebra  on  rq.  Also,  rq  is  metrizable.  Therefore,  noting  that 
nn  +  n  in  Pg-measure,  it  is  easily  seen  that  we  may  apply  part  (ii) 
already  proved,  to  assert  that  g(nn)  -*■  g(n)  in  Pg-measure,  which 
implies  (2.1).  [] 

REMARK  2.1.  (i)  It  is  possible  to  relax  the  assumption  of 

separability  in  Theorem  2.1,  at  the  expense  of  greatly  complicating 


the  formulation  and  tools.  This  entails  replacing  the  Borel  o-algebra 
B  with  a  suitable  sub-o-algebra,  considering  non-Borel  measures,  and 
utilizing  a  further  extension  of  Skcrokhod's  theorem  by  Wichura  (1970) 
to  arbitrary  metric  spaces.  For  discussion  of  convergence  in 
distribution  and  probability  in  this  setting,  see  Gaenssler  (1983), 
Chapter  3 . 

(ii)  It  would  be  of  interest  also  to  develop  an  analogous  theorem 
for  convergence  of  g(£n)  to  g(£)  almost  surely,  given  £  -*■  £  almost 

surely  [P] .  However,  a  preliminary  investigation  suggests  the  need 
for  a  refinement  of  Luzin's  theorem.  This  will  be  pursued  elsewhere. 
From  Remark  1.1  (i) ,  (ii)  we  immediately  have 

COROLLARY  2.1.  Let  (S,p)  be  a  separable  and  locally  compact 

metric  space,  with  corresponding  Borel  o-algebra  S.  Let  {Pn>  and  P 

be  probability  measures  on  S  such  that  either  (P  }  is  egu icontinuous 

n 

at  0  or  (Pn)  is  uniformly  absolutely  continuous  with  respect  to  P. 

Then,  for  every  S-measurable  function  g:  S  -*■  IR,  we  have: 

a)  »  p^1 

and 

P 

(ii)  9(?n)  g(5)  ,  whenever  {£fi}  and  £  are  measurable  mappings 

p 

of  some  (Q,A ,3P)  into  S  with  £fi  -*■  £ . 

The  next  result  is  of  special  interest  in  connection  with  problems 
such  as  the  statistical  application  discussed  at  the  end  of  Section  1. 
We  denote  by  A  the  Lebesgue  measure  on  ((0,1],B).  For  any  df  G  on  IR, 
put  G_1(t)  =  inf (x:  G(x)  >  t},t£  (0,1). 


COROLLARY  2.2.  Let  g:  IR  -*■  IR  be  Borel-measurable .  Let  {G  }  and 

n 

G  be  df's  on  IR  such  that  G  =»  G  and  {G  }  is  weakly  contiguous  to  G  at 

n  n 

a  Luzin  sequence  for  g  at  G.  Then  goGn  converges  to  g°G  ^  in 
Lebesgue  measure,  i.e.,  for  every  e  >  0, 

(2.4)  lim  X { t :  jgoG  ■1(t)  -  g°G-1(t)|  >  e}  =  0. 

n->-«  n 

PROOF.  We  apply  Theorem  2.1  with  S  =  IR  and  (fl,A,]P)  =  ([0,1],B,X) 
Now,  by  Lemma  1.5.6  of  Serfling  (1980),  the  convergence  Gn  =»  G  implies 
that  the  set  {t:  Gn  ^"(t)  /  G  ^(t),n-»-  °°)  has  at  most  countably  many 
elements.  That  is,  the  sequence  of  r.v.'s  £n(t)  =  Gn_1(t)  defined  on 
([0,1], 8)  converges  a.s.  [X]  and  hence  in  X-measure  to  the  r.v. 

£(t)  =  G  l(t).  And,  of  course,  the  df's  of  (£n)  and  £  are  simply 

(Gn>  and  G,  respectively.  Thus  (2.4)  follows  by  part  (ii)  of 

Theorem  2.1.  [] 

Statistical  applications  of  Corollary  2.2  are  as  follows.  Let 
X1'X2'**’  ke  independent  r.v.'s  on  IR  with  common  df  F,  and  define 
Fn(x)  =  n  1  KXi  <jc),  x  6  IR.  By  the  Glivenko-Cantelli  theorem 

it  follows  that  with  probability  1  { F  ( * ) }  is  a  sequence  of  df's 
satisfying  Fn  =»  F.  With  this  fact  and  Remark  1.1  (iv) ,  it  follows 
that  with  probability  1  (Fn(*))  is  a  sequence  of  df's  satisfying  the 

requirements  of  Corollary  2.2  with  Gn  =  Ffi  and  G  =  F.  Thus,  for  any 

Borel-measurable  g:  IR  -*■  IR,  we  have:  with  probability  1,  goFn_1 
converges  to  g°F  ^  in  Lebesgue  measure,  i.e.,  X{t:  l9°^’n  ^(t)  “ 
g°F  ^(t)  |  >  e)  -*•  0,  each  e  >  0.  This  result  is  precisely  (with 


different  notation)  Lemma  2.1  of  van  Zwet  (1980),  which  he  employs 

instrumentally  in  establishing  strong  convergence  of  statistics  of 

,1  -1 

the  form  T  =  J  g°F  (t)J  (t)dt,  under  suitable  conditions  on  {J  } 
n  v  u  n  n  n 

and  F.  Extensions  of  this  work  to  a  much  broader  class  of  statistics 
are  made  possible  by  the  generalization  of  van  Zwet's  lemma  given  by 
Theorem  2.1  (ii);  see  Helmers,  Janssen  and  Serfling  (1985). 
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